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RAW SEQUENCE LISTING DATE: 07/16/2004 

PATENT APPLICATION: US/10/501, 071 TIME: 16:37:40 

Input Set : A;\Sequence Listing.ST25.txt 
Output Set: N:\CRF4\07162004\J501071.raw 

5 <110> APPLICANT: University of Newcastle Upon Tyne 
9 <120> TITLE OF INVENTION: Fusion Proteins 

13 <130> FILE REFERENCE: 43952/JMD/MAR 
C--> 17 <140> CURRENT APPLICATION NUMBER: US/10/501 # 071 
C--> 17 <141> CURRENT FILING DATE: 2004-07-09 

17 <150> PRIOR APPLICATION NUMBER: GB 0200689.8 

19 <151> PRIOR FILING DATE: 2002-01-10 

23 <160> NUMBER OF SEQ ID NOS : 61 

27 <170> SOFTWARE: Patentln version 3.1 

31 <210> SEQ ID NO: 1 

33 <211> LENGTH: 9 

35 <212> TYPE: PRT 

37 <213> ORGANISM: Artificial Sequence 
41 <220> FEATURE: 

43 <223> OTHER INFORMATION: Ala3-His6 tail 
45 <400> SEQUENCE: 1 

47 Ala Ala Ala His His His His His His 

48 1 5 
51 <210> SEQ ID NO: 2 
53 <211> LENGTH: 25 
55 <212> TYPE: PRT 

57 <213> ORGANISM: Escherichia coli 
61 <400> SEQUENCE: 2 

63 Met Asn Met Lys Lys Leu Ala Thr Leu Val Ser Ala Val Ala Leu Ser 

64 1 5 10 15 

67 Ala Thr Val Ser Ala Asn Ala Met Ala 

68 20 25 
71 <210> SEQ ID NO: 3 
73 <211> LENGTH: 5 
75 <212> TYPE: PRT 

77 <213> ORGANISM: Artificial Sequence 
81 <22 0> FEATURE: 

83 <223> OTHER INFORMATION: Cleavage site for enterokinase 
85 <400> SEQUENCE: 3 

87 Asp Asp Asp Asp Lys 

88 1 5 
91 <210> SEQ ID NO: 4 
93 <211> LENGTH: 4 
95 <212> TYPE: PRT 

97 <213> ORGANISM: Artificial Sequence 
101 <220> FEATURE: 

103 <223> OTHER INFORMATION: Cleavage site for thrombin 
105 <400> SEQUENCE: 4 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/10/501 , 071 



DATE: 07/16/2004 
TIME: 16:37:40 



Input Set : A:\Sequence Listing.ST25.txt 
Output Set: N:\CRF4\07162004\J501071.raw 



107 Leu Val Pro Arg 

108 1 

111 <210> SEQ ID NO: 5 
113 <211> LENGTH: 4 
115 <212> TYPE: PRT 

117 <213> ORGANISM: Artificial Sequence 
121 <220> FEATURE: 

123 <223> OTHER INFORMATION: Cleavage site for factor Xa 
125 <400> SEQUENCE: 5 

127 He Glu Gly Arg 

128 1 

131 <210> SEQ ID NO: 6 
133 <211> LENGTH: 4 
135 <212> TYPE: PRT 

137 <213> ORGANISM: Artificial Sequence 
141 <220> FEATURE: 

143 <223> OTHER INFORMATION: 4xHis tag 
145 <400> SEQUENCE: 6 

147 His His His His 

148 1 

151 <210> SEQ ID NO: 7 
153 <211> LENGTH: 5 
155 <212> TYPE: PRT 

157 <213> ORGANISM: Artificial Sequence 
161 <220> FEATURE: 

163 <223> OTHER INFORMATION: 5xHis tag 
165 <400> SEQUENCE: 7 

167 His His His His His 

168 1 5 
171 <210> SEQ ID NO: 8 
173 <211> LENGTH: 6 
175 <212> TYPE: PRT 

177 <213> ORGANISM: Artificial Sequence 
181 <220> FEATURE: 

183 <223> OTHER INFORMATION: 6xHis tag 
185 <400> SEQUENCE: 8 

187 His His His His His His 

188 1 5 
191 <210> SEQ ID NO: 9 
193 <211> LENGTH: 7 
195 <212> TYPE: PRT 

197 <213> ORGANISM: Artificial Sequence 
201 <220> FEATURE: 

203 <223> OTHER INFORMATION: 7xHis tag 
205 <400> SEQUENCE: 9 

207 His His His His His His His 

208 1 5 
211 <210> SEQ ID NO: 10 
213 <211> LENGTH: 8 
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215 <212> TYPE: PRT 

217 <213> ORGANISM: Artificial Sequence 
221 <220> FEATURE: 

223 <223> OTHER INFORMATION: 8xHis tag 
225 <400> SEQUENCE: 10 

227 His His His His His His His His 

228 1 5 
231 <210> SEQ ID NO: 11 
233 <211> LENGTH: 9 
235 <212> TYPE: PRT 

237 <213> ORGANISM: Artificial Sequence 
241 <220> FEATURE: 

243 <223> OTHER INFORMATION: 9xHis tag 
245 <400> SEQUENCE: 11 

247 His His His His His His His His His 

248 1 5 
251 <210> SEQ ID NO: 12 
253 <211> LENGTH: 10 
255 <212> TYPE: PRT 

257 <213> ORGANISM: Artificial Sequence 
261 <220> FEATURE: 

263 <223> OTHER INFORMATION: lOxHis tag 
265 <400> SEQUENCE: 12 

267 His His His His His His His His His His 

268 15 10 
271 <210> SEQ ID NO: 13 

273 <211> LENGTH: 93 
275 <212> TYPE: PRT 

277 <213> ORGANISM: Escherichia coli 
281 <400> SEQUENCE: 13 



283 


Asn 


Asn 


Gly 


Ala 


Ser 


Gly 


Ala 


Asp 


He Asn 


Asn 


Tyr 


Ala Gly Gin 


He 


284 


1 








5 








10 






15 




287 


Lys 


Ser 


Ala 


He 


Glu 


Ser 


Lys 


Phe 


Tyr Asp Ala 


Ser 


Ser Tyr Ala Gly 


288 








20 










25 






30 




291 


Lys 


Thr 


Cys 


Thr 


Leu 


Arg 


He 


Lys 


Leu Ala 


Pro Asp 


Gly Met Leu 


Leu 


292 






35 










40 








45 




295 


Asp 


He 


Lys 


Pro 


Glu 


Gly 


Gly 


Asp 


Pro Ala 


Leu 


Cys 


Gin Ala Ala 


Leu 


296 




50 










55 








60 






299 


Ala 


Ala 


Ala 


Lys 


Leu 


Ala 


Lys 


He 


Pro Lys 


Pro 


Pro 


Ser Gin Ala 


Val 


300 


65 










70 








75 






80 


303 


Tyr 


Glu 


Val 


Phe 


Lys 


Asn 


Ala 


Pro 


Leu Asp 


Phe 


Lys 


Pro 





304 85 90 

307 <210> SEQ ID NO: 14 

309 <211> LENGTH: 348 

311 <212> TYPE: PRT 

313 <213> ORGANISM: Artificial Sequence 

317 <220> FEATURE: 

319 <223> OTHER INFORMATION: TolA-BCL fusion protein 

321 <400> SEQUENCE: 14 
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323 


nc Lr n x o 11 X 0 


His 


His 


His 


His 




Ser 


Asn 


Asn 


Glv 


Ala 


Ser 


Glv 
oxy 


Ala 


324 


1 




5 










10 










15 




^97 


Agti Tl tf» Ron 
nop 11C noli 


AOll 


r PVY* 


Al a 


m v 

0 x y 




He 

x x e 


Lys 


oe x 


Ala 


He 


Glu 


OCX 


xiy 0 


328 




2 0 










25 










30 






7 71 

j ji 


ir I1C 1 y i. nop 


AT a. 

nl ct 


iJCl 


Cpy" 


iyr 


Al a. 


oxy 


xjy 0 


X 111 


Jr O 


X 111. 


xje LL 




X X c 


332 


35 










40 










45 








"\ "X r 


T ire T 01 1 Si a 
iiyb lieu Mxci 


Pro 


nop 


m \7 

oiy 


l y icL 




T,oi 1 
XICU 


A CT> 

nop 


x 1c 


Xjy 0 


XT X W 


Ol Li 


oxy 


ox y 


JJU 


ro 
0 u 








R R 
O O 










fiO 












nSp riO Ala 


Leu 


Cys 


m n 


Al a 


AT a 
i-ilci 


T 01 1 
xieU 


AT = 


AT a 
nld 


AT = 


iiy 0 


T .01 1 
XjC U. 


AT a 
nXa 


Xjy 0 


O ft U 


£ R 
0 O 






/ U 










7 R 
/ O 










ft 0 




lie Fro ±jys 


Pro 


Pro 


Ser 


01 -n 

bin 


AT a 

Aia 


vai 


Tyr 


bill 


V O.X 


rile 


Lys 


TV 

Asn 


A 1 -a 
i-ilcl 


"3 A A 
J ft ft 






Q R 
OO 










on 










QR 
^ 0 




Oft / 


Piro Leu Asp 


riie 


Lys 


Pro 


biy 


biy 


Kj±y 


Ser 


biy 


Ser 


Leu 


Veil 


Pro 


Arg 


oft o 




ion 

1UU 










1 O R 
x U D 










110 
11U 






J j1 


Gly Ser Arg 


Pro 


Ser 


bin 


Ser 


Asn 


Arg 


bill 


Leu 


vai 


v a i 
vai 


Asp 


fne 


Leu 


"3 R9 


1 1 c 
11 0 










19 0 










IOC 
X -> 








T C C 

355 


Ser Tyr Lys 


Leu 


Ser 


bin 


Lys 


biy 


Tyr 


Ser 


Trp 


Ser 


bin 


Fne 


Ser 


Asp 


ODD 


1 "5 O 








loo 










1 A 0 
lffc u 










"D C O 


vai biu biu 


Asn 


Arg 


inr 


Pi 11 
blU 


A 1 -j 

Aia 


Pro 


r*i 11 

bill 


Pi tr 

biy 


inr 


blU 


Ser 


bill 


l v le L. 


JbU 


lflO 






1 RO 










1 RR 
x O O 










1^0 

X D U 


"3 c o 
JDJ 


Glu Thr Pro 


Ser 


Ala 


Ti- 
ne 


Asn 


(jiy 


Asn 


Pro 


Ser 


Trp 


TT J 

HIS 


Leu 


Ala 

Aia 


Asp 


O Oft 






ice 










1 *70 










1 7R 
I/O 




-SO / 


Ser Pro Ala 


vai 


Asn 


biy 


Ala 

Aia 


Thr 


A 1 =a 

Aia 


rllS 


Ser 


Ser 


Ser 


Leu 


Asp 


A 1 -a 

Aia 


"3 C Q 




loll 










1 Q R 

loo 










1 QO 
13U 






J /l 


Arg biu vai 


Tin 

lie 


Pro 


Mec 


Aia 


Ala 

Aia 


vai 


Lys 


bin 


Ala 

Aia 


Leu 


Arg 


blU 


Al a 

Aia 


O 












900 










9 0 R 

£t \J O 








•57c 
O /O 


biy Asp blU 


Fne 


blU 


Leu 


Arg 


Tyr 


Arg 


Arg 


A 1 

Aia 


riie 


Ser 


Asp 


T 

Leu 


1 11 X 


*5 7£ 


91 0 








9 1 R 
Z 1 O 










990 

j!L Z> \J 










•J7Q 

0 / y 


ber bin Lieu 


nlo 


Tl 0 

lie 


Thr 


Pro 


r*1 

vjiy 


Tnr 


A 1 =3 

Ala 


Tyr 


bin 


Ser 


DV10 
file 


Ol Li 


Olii 


7 AH 


z o 






9 ^ 0 










9 1R 










240 


JOJ 


Val val noil 


bi u. 


Leu 


file 


Arg 


Asp 


rji tt 


V Ct X 


A cn 
noil 


irp 


oxy 


ril y 


11c 


V Ct X 


J 0 *± 






94R 










250 










255 




1fi7 

JO / 


nlu file lr lie 


Co y 


xriie 


ui y 


m v 


AT = 


T .<=n 1 

XjC Li 


jf O 


Val 

v 0. x 


OX Li 


0 e x 


Val 


A qn 
nop 


Lys 


"}ftft 

000 




260 










265 










270 






J .7 J. 


Glu Met Gin 


Val 


Leu 


Val 


0 ex 




lie 

x x e 


Ala 


Ala 


Tyn 
lip 


Met 


Ala 


Thr 


TVr 
xy x 


0 j ^ 


275 










2 80 










285 








395 


Leu Asn Asp 


His 


Leu 


Glu 


Pro 


Trp 


He 


Gin 


Glu 


Asn 


Glv 


Glv 

oxy 


Tro 

i xp 


Asp 


O 27 O 


290 








295 










"^00 

0 u u 










399 


Thr Phe Val 


Glu 


Leu 


Tyr 


Gly 


Asn 


Asn 


Ala 


Ala 


Ala 


Glu 


Ser 


Arg 


Lys 


400 


305 






310 










315 










320 


403 


Gly Gin Glu 


Arg 


Phe 


Asn 


Arg 


Trp 


Phe 


Leu 


Thr 


Gly 


Met 


Thr 


Val 


Ala 


404 






325 










330 










335 




407 


Gly Val Val 


Leu 


Leu 


Gly 


Ser 


Leu 


Phe 


Ser 


Arg 


Lys 










408 




340 










345 
















411 


<210> SEQ ID NO 


: 15 
























413 


<211> LENGTH: 236 
























415 


<212> TYPE: 


PRT 


























417 


<213> ORGANISM: 


Artificial Sequence 
















421 


<220> FEATURE: 
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423 


<223> OTHER 


INFORMATION: 


: TolA-BCL fusion protein after thrombin 


425 


<400> SEQUENCE: 


15 
























427 


m ir Car Z^y*<"t 

uiy ocx. x-ix y 


ft u 


Ser 


Gin 


Ser 


Asn 


Ara 


Glu 


Leu 


Val 


Val 


Asp 


Phe 


Leu 


428 


i 
x 




-> 










10 










X -D 




41 1 


Co>* TrrT* T ire 

oexr lyr iiys 


T 

Leu 


OCX 




XJjr O 


V7 X y 


xyx 


Ser 


xip 


Ser 


Gin 




OCX 




*± -j 




20 


























*x O J 


val <alU \jJ-U 


Asn 




Thr 


Glu 


Ala 


Pro 


Glu 


Glv 


Thr 


Glu 


Ser 


Glu 


Met 


436 


1 R 




























4 7 Q 


urlU llii rlO 


Ser 


Ai a 


lie* 

x xc 


Asn 


ox y 


Asn 


Pro 


Ser 


xxp 


His 


Leu 


Ala 


Asp 


440 


o U 


















o \j 










441 


ber Fro Aia 


Va1 
val 


noil 


uiy 


ai * 


X XIX 


ai a 


nib 


OCX 


OCX 


ocx 


T.^i i 

XJC LX 


a cn 
nop 


ai a 

AX CI 


444 


D D 






/ u 




















ftfl 
O \J 


44 *7 


Arg Cjiu vai 


lie 




Met- 
ric L» 


ai ^ 


ai * 


V^l 

V ClX 


xjy o 


V7X11 


ai a 


XJC LX 


R YTT 


VJX LX 


ai p 

/ix d 


44 ft 
















on 














ART 
ffcD 1 


Gly Asp Glu 


Fne 




T 

Leu 


Arg 


Tyr 


Arg 


Arg 


an ^ 


XrXIC 


Oar 

OCX 


i-io LJ 


T .01 1 
XjC IX 


X 1 IX 






i nn 










iuo 










I 1 A 

I I u 






ACQ 
ft 3 D 


ber vj±n Lieu 


TJ J — 


Tip 

lie 


X IXX 


Dm 
c X LJ 


vj x y 


Th-r 

X J.1X 


ai a 

nx a 


iyx 


Gin 


OCX 


rue 


OX LX 


Gin 


*± J o 


1 1 c 
11j 










1Z U 










IOC 
1Z 3 








ft D i7 


vai vai Asn 


blU 


XjC lx 


rile 


^-ix y 




m v 

uiy 


vax 


a qti 


lip 


Gly 




lie 


Val 


*± D U 


130 








lie; 
1 j j 










X*r VJ 










4^1 
ft D J 


Ala Phe Phe 


Ser 


file 


uiy 


oly 


ai a 


XjCU. 


vyb 


V ClX 


vJX IX 


OCX 


vax 


a C!T*V 


xiy 0 


4fi4 


145 






1 en 
1 O U 




















X 0 u 


4^7 
ft D / 


Glu Met Gin 


Val 


Leu 


vai 


Ser 


Arg 


lie 


ai a 


ai ^ 

nia 


irp 


Mpf 


ai a 

nxa 


Th-r 

X XXX 


iyi 


4fi ft 
rt O O 






165 










1 70 

X / <J 










175 




471 
*± / X 


Leu Asn Asp 


His 


Leu 


Glu 


Pro 


Trp 


He 


bill 


ulU 


/loll 


vj! y 




irp 


no LJ 


479 
ft / Z 




180 










185 










190 






475 


Thr Phe Val 


Glu 


Leu 


Tyr 


Gly 


Asn 


Asn 


Ala 


Ala 


Ala 


Glu 


Ser 


Arg 


Lys 


47 £ 


195 










200 










205 








479 


Gly Gin Glu 


Arg 


Phe 


Asn 


Arg 


Trp 


Phe 


Leu 


Thr 


Gly 


Met 


Thr 


Val 


Ala 




210 








215 










220 










483 


Gly Val Val 


Leu 


Leu 


Gly 


Ser 


Leu 


Phe 


Ser 


Arg 


Lys 










484 


225 






230 










235 












487 


<210> SEQ ID NO 


: 16 
























489 


<211> LENGTH: 115 
























491 


<212> TYPE: 


PRT 


























493 


<213> ORGANISM: 


Artificial Sequence 
















497 


<22 0> FEATURE: 


























499 


<223> OTHER 


INFORMATION 


: Tagged TolAlII 


region of pTol vectors 


501 


<22 0> FEATURE: 


























503 


<221> NAME/KEY: 


MIS C_F EATUR E 


















505 


<222> LOCATION: 


(107) . . 


(111) 


















507 


<223> OTHER 


INFORMATION 


: Xaa residues represent 


cleavage sites : 



(SEQ ID NO: 3) 



LVPR 



508 (SEQ ID NO: 4; no Xaa at position 111) or IEGR (SEQ ID NO: 5; no 

509 Xaa at position 111) 
513 <400> SEQUENCE: 16 

515 Met His His His His His His Ser Ser Asn Asn Gly Ala Ser Gly Ala 

516 15 10 15 

519 Asp He Asn Asn Tyr Ala Gly Gin He Lys Ser Ala He Glu Ser Lys 

520 20 25 30 
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RAW SEQUENCE LISTING ERROR SUMMARY 
PATENT APPLICATION: US/10/501, 071 



DATE: 07/16/2004 
TIME: 16:37:41 



Input Set : A:\Sequence Listing.ST25.txt 
Output Set: N:\CRF4\07162004\J501071.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 

Seq#:16; Xaa Pos . 107,108,109,110,111 

Seq#:22; Xaa Pos. 14,15 

Seq#:23; Xaa Pos. 13,14 

Seq#:24; Xaa Pos. 13,14 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/10/501,071 



DATE: 07/16/2004 
TIME: 16:37:41 



Input Set : A:\Sequence Listing.ST25.txt 
Output Set: N:\CRF4\07162004\J501071.raw 



L:17 M:270 C: Current Application Number differs, Replaced Current Application No 

L:17 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:539 M:341 W: (46) "n M or "Xaa" used, for SEQ ID#:16 after pos.:96 

L:667 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:22 after pos . : 0 

L:701 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:23 after pos . : 0 

L:731 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:24 after pos . : 0 
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STATISTICS SUMMARY DATE: 07/16/2004 

PATENT APPLICATION: US/10/501,071 TIME: 16:37:41 



Input Set : A:\Sequence Listing.ST25.txt 
Output Set: N:\CRF4\07162004\J501071.raw 

Application Serial Number: US/10/501,071 
Alpha or Numeric or Xml: Numeric 
Application Class: 

Application File Date: 07-09-2004 
Art Unit: PCT 

Software Application: PatentIN3.1 

Total Number of Sequences: 61 

Total Nucleotides: 1032 

Total Amino Acids: 262 9 

Number of Errors: 0 

Number of Warnings: 4 

Number of Corrections: 2 

MESSAGE SUMMARY 

270 C: 1 (Current Application Number differs) 

271 C: 1 (Current Filing Date differs) 
341 W: 4 ((46) M n" or "Xaa" used) 
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